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[57] ABSTRACT 

Method, system and computer program product are provided 
for adaptively encoding in hardware, software or a combi- 
nation thereof a sequence of video frames in real-time. A 
first encoding subsystem analyzes the sequence of video 
frames to derive information on at least one characteristic 
thereof, such as motion statistics, non-motion statistics, 
scene change statistics, or scene fade statistics. The gathered 
information may be either an intraframe characteristic or an 
interframe characteristic. A control processor is coupled to 
the first encoding subsystem to automatically analyze the 
gathered information in real time and dynamically produce 
a set of control parameters. A second encoding subsystem, 
coupled to the control processor, then encodes each frame of 
the sequence of video frames employing the corresponding 
set of control parameters. 

29 Claims, 9 Drawing Sheets 
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REAL-TIME ENCODING OF VIDEO 
SEQUENCE EMPLOYING TWO ENCODERS 
AND STATISTICAL ANALYSIS 

CROSS-REFERENCE TO RELATED 
APPLICATIONS 

This application comprises a continuation-in-part appli- 
cation of co-pending, commonly assigned U.S. patent appli- 
cation Ser. No. 08/948,442, filed Oct. 10, 1997, entitled 
"Adaptive Real-Time Encoding Of Video Sequence 
Employing Image Statistics," which is hereby incorporated 
herein by reference in its entirety. Further, this application is 
related to co-filed, commonly assigned U.S. patent applica- 
tion Ser. No. 09/008,282, entitled "Real-Time Variable Bit 
Rate Encoding Of Video Sequence Employing Image 
Statistics," which is hereby incorporated herein by reference 
in its entirety. 

TECHNICAL FIELD 

This invention relates in general to compression of digital 
visual images, and more particularly, to a technique for 
real-time encoding of a video sequence using image statis- 
tics derived from the video sequence to dynamically change 
one or more controllable encoding parameter(s) from frame 
to frame or within a frame. 

BACKGROUND OF THE INVENTION 

Within the past decade, the advent of world-wide elec- 
tronic communications systems has enhanced the way in 
which people can send and receive information. In 
particular, the capabilities of real-time video and audio 
systems have greatly improved in recent years. In order to 
provide services such as video-on-demand and video con- 
ferencing to subscribers, an enormous amount of network 
bandwidth is required. In fact, network bandwidth is often 
the main inhibitor in the effectiveness of such systems. 

In order to overcome the constraints imposed by 
networks, compression systems have emerged. These sys- 
tems reduce the amount of video and audio data which must 
be transmitted by removing redundancy in the picture 
sequence. At the receiving end, the picture sequence is 
uncompressed and may be displayed in real-time. 

One example of an emerging video compression standard 
is the Moving Picture Experts Group ("MPEG") standard. 
Within the MPEG standard, video compression is defined 
both within a given picture and between pictures. Video 
compression within a picture is accomplished by conversion 
of the digital image from the time domain to the frequency 
domain by a discrete cosine transform, quantization, and 
variable length coding. Video compression between pictures 
is accomplished via a process referred to as motion estima- 
tion and compensation, in which a motion vector plus 
difference data is used to describe the translation of a set of 
picture elements (pels) from one picture to another. 

The ISO MPEG -2 standard specifies only the syntax of 
bitstream and semantics of the decoding process. The choice 
of coding parameters and tradeoffs in performance versus 
complexity are left to the encoder developers. 

One aspect of the encoding process is compressing a 
digital video image into as small a bitstream as possible 
while still maintaining video detail and quality. The MPEG 
standard places limitations on the size of the bitstream, and 
requires that the encoder be able to perform the encoding 
process. Thus, simply optimizing the bit rate to maintain 
desired picture quality and detail can be difficult. 
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For example, a bit rate is defined in bits per second. Based 
on the frame rate and type of picture being encoded, a 
number of bits per picture is assigned. At 6,000,000 bits per 
second (6 Mbps), and pictures at 30 picture frames per 

5 second, each picture would be allocated 200,000 bits assum- 
ing that the bits are allocated uniformly. With a 720x480 
picture having 1350 macroblocks, this translates into 148 
bits allocated per macroblock. Thus, in the case of scene 
changes and action videos, the bit rate can be quickly 

10 consumed with drastic changes between macroblocks and/or 
between frames. Picture quality and detail can suffer as a 
result. 

Video compression requires advanced techniques to 
achieve the highest amount of compression, while still 

35 obtaining a desired picture quality. Variable bit rate (VBR) 
encoding is an option in video compression that allows for 
each compressed picture to have a different amount of bits 
based on complexity of intra and inter-picture characteris- 
tics. For example, scenes with simple picture content (such 

20 as a color test pattern) will require significantly less bits to 
encode than scenes with complicated picture content (such 
as a crowded city street) in order to obtain desired picture 
quality. Because of the amount of information that is needed 
to characterize the video and the complexity of the algo- 

25 rithms needed to interpret the information to effectively 
enhance the encoding process, VBR encoding is conven- 
tionally accomplished in a non-real-time, two or more pass 
encoding process. In a first pass, statistics are gathered and 
analyzed, and in a second pass, the results of the analysis are 

30 used to control the encoding process. Although this produces 
in high quality encoding, it does not allow for real-time 
operation. 

This invention seeks to enhance picture quality of an 
encoded video sequence while still obtaining a high com- 
35 pression rate by providing a real-time VBR a video encoding 
scheme. 

DISCLOSURE OF THE INVENTION 

40 Briefly summarized, this invention comprises in one 
aspect a method for encoding a sequence of frames. The 
method includes: employing a first encoding subsystem to 
analyze the sequence of video frames and derive information 
on at least one characteristic thereof, the at least one 
45 characteristic being at least one of scene change, picture 
quality, bits used, target bit rate, and picture type; automati- 
cally processing the at least one characteristic in real-time to 
produce a value for at least one controllable parameter used 
in encoding the sequence of video frames; and encoding the 
50 sequence of video frames employing a second encoding 
subsystem and using the value of the at least one controllable 
parameter to produce a bitstream of encoded video data. 

In another aspect, the invention comprises a system for 
encoding a sequence of video frames. The system includes 
55 a first encoding subsystem, a control processor and a second 
encoding subsystem. The first encoding subsystem is 
employed to analyze the sequence of video frames and 
derive information on at least one characteristic thereof. The 
at least one characteristic comprises at least one of scene 
60 change, picture quality, bits used, target bit rate, and picture 
type. The control processor is coupled to the first encoding 
subsystem for automatically processing in real-time the 
information on the at least one characteristic to produce a 
value for at least one controllable parameter used in encod- 
es ing the sequence of video frames. The second encoding 
subsystem is coupled to the control processor for receiving 
the value of the at least one controllable parameter. The 
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second encoding subsystem encodes each frame of the block in a subsequent or previous frame or picture. Elements 

sequence of video frames using the corresponding value of 211 and 211' represent the same location in both pictures, 

the at least one controllable parameter, thereby producing a FIG. 4 illustrates the movement of blocks in accordance 

bitstream of encoded video data. with the motion vectors from their position in a previous 

In a further aspect, the invention comprises a computer 5 picture to a new picture, and the previous picture's blocks 

program product including a computer usable medium hav- adjusted after using motion vectors, 

ing computer readable program code means therein for use FIG. 5 shows a flow diagram of an encoding system 300 

in encoding a sequence of video frames. The computer employing a first encoding subsystem El and a second 

readable program code means in the computer program encoding subsystem E2 in accordance with the principles of 

product includes computer readable program code means for 30 the present invention. Subsystem El is configured to derive 

causing a computer to affect: analyzing the sequence of statistics on one or more characteristics of a sequence of 

video frames and deriving information on at least one frames to be encoded. These characteristics are employed by 

characteristic thereof, the at least one characteristic com- subsystem E2 to adaptively encode the sequence of frames 

prising at least one of scene change, picture quality, bits to optimize picture quality and/or encoding performance, 

used, target bit rate, and picture type; automatically process- 15 FIG. 6 is a generalized diagram of an encoding subsystem 

ing the at least one characteristic to produce a value for at in accordance with the present invention. Subsystem El is 

least one controllable parameter used in encoding the employed to generate, for example, non -motion statistics 

sequence of video frames; and encoding the sequence of and inter/intra motion statistics using non-motion statistics 

video frames using the value of the at least one controllable gathering 420 and encoding engine 410, respectively, while 

parameter to produce a bitstream of encoded video data. 20 subsystem E2 generates the encoded bitstream using encod- 

In general, encoding in accordance with the principles of ing engine 410. 
the present invention results in improved picture quality FIG. 7 is a block diagram of an alternate embodiment of 
compared with non-adaptive encoder systems, especially at an encoding system 500 in accordance with the principles of 
low bit rates. This is because, for example, employing this invention. System 500 employs a first encoding sub- 
adaptive bit allocation among frames (as well as within 25 system El 510 and a second encoding subsystem E2 540 
frames) is more critical in low bit rate encoding compared coupled via a controlling processor 520. Subsystem El is 
with higher bit rate encoding. The present invention com- configured to derive statistics on one or more characteristics 
prises a real-time video data encoding approach which of a sequence of frames to be encoded. These characteristics 
employs two encoders and real-time statistical processing. are statistically analyzed by controlling processor 520 to 
The statistical processing is accomplished within a processor 30 dynamically develop a value for one or more controllable 
coupled between the first encoder and the second encoder to parameters to be used by subsystem E2 in encoding a current 
analyze the statistics produced by the first encoder and to frame of the sequence of frames, thereby optimizing picture 
develop encoding parameters for the second encoder. The quality and/or encoding performance, 
second encoder then uses the enhanced encoding parameters piG. 8 is a generalized flow diagram pursuant to the 
to provide a high quality, highly compressed video stream. 35 present invention, wherein a sequence of video data is input 
The MPEG-2 Standard is assumed; however, the concepts t0 subsystem El, which produces information on scene 
presented herein can apply to other standards as well. change, picture quality, bits used, picture type and/or target 
Further, the encoding technique of this invention can insure hit rate. This information is forwarded to controlling pro= 
a semi-constant picture quality of a decoded video sequence ceS sor (CP) 520, which in this example provides a quanti- 
in constant bit rate (CBR) mode or a constant picture quality 40 za tion parameter (Mquant) to the second encoding sub- 
in variable bit rate (VBR) encoding mode. system E2 540. Subsystem 540 also receives as input a 
BRIEF DESCRIPTION OF THE DRAWINGS synchronized, delayed version of the sequence of video data. 

Hie above-described objects, advantages and features of FIG * 9 15 a ^ c ^ r [ of one embodiment of statistical 

the present invention, as well as others, will be more readily 45 P roccssm g performed by controlling processor (CP) in 

understood from the following detailed description of cer- deve l°P in g °°f or more parameters for use in controlling 

tain preferred embodiments of the invention, when consid- enco £ n S of K the sec £ ence of Vlde0 dala b > the 

ered in conjunction with the accompanying drawings in encoding subsystem E2. 

which: BEST MODE FOR CARRYING OUT THE 

FIG. 1 shows a flow diagram of a generalized MPEG-2 50 INVENTION 
compliant encoder 11, including a discrete cosine trans- Th e invention relates, for example, to MPEG compliant 
former 21, a quantizer 23, a variable length coder 25, an encoders and encoding processes such as described in 
inverse quantizer 29, an inverse discrete cosine transformer "Information Technology-Generic coding of moving pic- 
31, motion compensation 41, frame memory 42, and motion lures an d associated audio information: Video," Recommen- 
estimation 43. The data paths include the i th picture input 55 d ation ITU-T H.262, ISO/IEC 13818-2, Draft International 
111, difference data 112, motion vectors 113 (to motion Standard, 1994. The encoding functions performed by the 
compensation 41 and to variable length coder 25), the encoder include data input, spatial compression, motion 
picture output 121, the feedback picture for motion estima- estimation, macroblock type generation, data reconstruction, 
tion and compensation 131, and the motion compensated entropy coding, and data output. Spatial compression 
picture 101. This figure has the assumptions that the i th 60 includes discrete cosine transformation (DCT), quantization, 
picture exists in frame memory or frame store 42 and that the a nd entropy encoding. Temporal compression includes 
i+1' is being encoded with motion estimation. intensive reconstructive processing, such as inverse discrete 
FIG. 2 illustrates the I, P, and B pictures, examples of their cosine transformation, inverse quantization, and motion 
display and transmission orders, and forward, and backward compensation. Motion estimation and compensation are 
motion prediction. 65 use d f or temporal compression functions. Spatial and tem- 
FIG. 3 illustrates the search from the motion estimation poral compression are repetitive functions with high corn- 
block in the current frame or picture to the best matching putational requirements. 
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More particularly the invention relates, for example, to a picture display order. See FIG. 2. For I-P-B systems the 

process for performing spatial and temporal compression input picture transmission order is different from the encod- 

including discrete cosine transformation, quantization, ing order, and the input pictures must be temporarily stored 

entropy encoding, motion estimation, motion compensation, until used for encoding. A buffer stores this input until it is 

and prediction, and even more particularly to a system for 5 used. 

accomplishing spatial and temporal compression. For purposes of illustration, a generalized flowchart of 

The first compression step is the elimination of spatial MPEG compliant encoding is shown in FIG. 1 In the 

redundancy, for example, the elimination of spatial redun- flowchart tfa e images of the ithpicture and the i+l f picture 

dancy in a still picture of an "I" frame picture. Spatial f re Passed to generate motion vectors. The motion vec- 

redundancy is the redundancy within a picture. The MPEG-2 io Precto w here a macroblock of pixels wiU be in a prior 

c* a a • • ui i u / *u a c a • and/or subsequent picture. The use of the motion vectors is 

Draft Standard is using a block based method of reducing a k 4 Qf fc £ ^ ession in the MPEG stan . 

spatial redundancy. The method of choice is the discrete dard M shown x \ he motion vectors> once 

cosine transformation, and discrete cosine transform coding generated, are used for the translation of the macroblocks of 

of the picture. Discrete cosine transform coding is combined pixelSj from the p pi cwr e to the picture, 

with weighted scalar quantization and run length coding to is M SQ0WQ m nG t {q tfae encoding process> the images 

achieve desirable compression. of the p picture and the picture are processed m the 

The discrete cosine transformation is an orthogonal trans- encoder 11 to generate motion vectors which are the form in 

formation. Orthogonal transformations, because they have a which, for example, the and subsequent pictures are 

frequency domain interpretation, are filter bank oriented. encoded and transmitted. An input image 111 of a subse- 

The discrete cosine transformation is also localized. That is, 20 quent picture goes to the motion estimation unit 43 of the 

the encoding process samples on an 8x8 spatial window encoder. Motion vectors 113 arc formed as the output of the 

which is sufficient to compute 64 transform coefficients or motion estimation unit 43. These vectors are used by the 

sub-bands, motion compensation Unit 41 to retrieve macroblock data 

Another advantage of the discrete cosine transformation is previous and/or future pictures, referred to as "refer- 
that fast encoding and decoding algorithms are available. 25 ence data for outpu by m K unit One output of the motion 
ajj *- « .u u u j j c ii_ j ■ compensation Unit 41 is negatively summed with the output 
Additionally the sub-band decomposition of the discrete motkm estimation * unit ^ and t0 the { ? of 

cosine transformation is sufficiently well behaved to allow ±Q Discrete Cosine Xransformer 21 % Q out t £ the 

effective use of psychovisual criteria. discrete CQsine transformer 21 ^ quantize d in a quantizer 23. 

After transformation, many of the frequency coefficients 3Q Th e output of the quantizer 23 is split into two outputs, 121 

are zero, especially the coefficients for high spatial frequen- and 131; one output m goes to a downstream element 25 

cies. These coefficients are organized into a zig-zag or f or further compression and processing before transmission, 

alternate-scanned pattern, and converted into run-amplitude sucn as t0 a mn i erjgtD encoder; the other output 131 goes 

(run-level) pairs. Each pair indicates the number of zero through reconstruction of the encoded macroblock of pixels 

coefficients and the amplitude of the non-zero coefficient. 35 f or storage in frame memory 42. In the encoder shown for 

This is coded in a variable length code. purposes of illustration, this second output 131 goes through 

Motion compensation is used to reduce or even eliminate an inverse quantization 29 and an inverse discrete cosine 

redundancy between pictures. Motion compensation transform 31 to return a lossy version of the difference 

exploits temporal redundancy by dividing the current picture macroblock. This data is summed with the output of the 

into blocks, for example, macroblocks, and then searching in 40 motion compensation unit 41 and returns a lossy version of 

previously transmitted pictures for a nearby block with the original picture to the frame memory 42. 

similar content. Only the difference between the current As shown in FIG. 2, there are three types of pictures, 

block pels and the predicted block pels extracted from the There are "Intra pictures" or "I" pictures which are encoded 

reference picture is actually compressed for transmission and transmitted whole, and do not require motion vectors to 

and thereafter transmitted. 45 be defined. These "I" pictures serve as a reference image for 

The simplest method of motion compensation and pre- motion estimation. There are "Predicted pictures" or "P" 

diction is to record the luminance and chrominance, i.e., pictures which are formed by motion vectors from a previ- 

intensity and color, of every pixel in an "I" picture, then ous picture and can serve as a reference image for motion 

record changes of luminance and chrominance, i.e., intensity estimation for further pictures. Finally, there are "Bidirec- 

and color for every specific pixel in the subsequent picture. 50 tional pictures" or "B" pictures which are formed using 

However, this is uneconomical in transmission medium motion vectors from two other pictures, one past and one 

bandwidth, memory, processor capacity, and processing future, and can not serve as a reference image for motion 

time because objects move between pictures, that is, pixel estimation. Motion vectors are generated from "I" and "P" 

contents move from one location in one picture to a different pictures, and are used to form "P" and "B" pictures, 

location in a subsequent picture. A more advanced idea is to 55 One method by which motion estimation is carried out, 

use a previous or subsequent picture to predict where a block shown in FIG. 3, is by a search from a macroblock 211 of 

of pixels will be in a subsequent or previous picture or an I th picture throughout a region of the next picture to find 

pictures, for example, with motion vectors, and to write the the best match macroblock 213. Translating the macroblocks 

result as "predicted pictures" or "P" pictures. More in this way yields a pattern of macroblocks for the i+l th 

particularly, this involves making a best estimate or predic- eo picture, as shown in FIG. 4. In this way the i' h picture is 

tion of where the pixels or macroblocks of pixels of the i th changed a small amount, e.g., by motion vectors and dif- 

picture will be in the i-l th or i+l r/ * picture. It is one step ference data, to generate the i+l'* picture. What is encoded 

further to use both subsequent and previous pictures to are the motion vectors and difference data, and not the i+l" 1 

predict where a block of pixels will be in an intermediate or picture itself. Motion vectors translate position of an image 

"B" picture. 65 fj 0m p i c ture to picture, while difference data carries changes 

To be noted is that the picture encoding order and the in chrominance, luminance, and saturation, that is, changes 

picture transmission order do not necessarily match the in shading and illumination. 



07/23/2004, EAST Version: 1.4.1 



5,978,029 



8 



Returning to FIG. 3, we look for a good match by starting 
from the same location in the i th picture as in the i+1 
picture. A search window is created in the X th picture. We 
search for a best match within this search window. Once 
found, the best match motion vectors for the macroblock are 
coded. The coding of the best match macroblock includes a 
motion vector, that is, how many pixels in the y direction and 
how many pixels in the x direction is the best match 
displaced in the next picture. Also encoded is difference 
data, also referred to as the "prediction error", which is the 
difference in chrominance and luminance between the cur- 
rent macroblock and the best match reference macroblock. 

The operational functions of an MPEG -2 encoder are 
discussed in detail in commonly assigned, co-pending U.S. 
patent application Ser. No. 08/831,157, by Carr et al., filed 
Apr. 1, 1997, entitled "Control Scheme For Shared-Use 
Dual-Port Predicted Error Array," which is hereby incorpo- 
rated herein by reference in its entirety. 

As noted initially, encoder performance and/or picture 
quality may be enhanced in accordance with the principles 
of this invention through real-time adaptive video encoding. 
The video encoder is constructed to be adaptive to the video 
data received as a sequence of frames. In accordance with 
one embodiment of this invention, two encoding subsystems 
are employed. A significant advantage of using two encod- 
ing subsystems is the ability to analyze the video sequence 
prior to its real-time encoding. Analysis of the video 
sequence comprises calculating one or more statistics which 
can be derived from the video data. 

The statistical measures can describe different character- 
istics of an image frame, for example, busyness of a frame, 
motion between image frames, scene change or fading, etc. 
Using the calculated statistics, adaptive encoding of the 
video sequence is then carried out by controlling one or 
more encoding parameters of the real-time encoding pro- 
cess. For example, bit allocation, quantization parameter(s), 
encoding mode, etc., can be changed from frame to frame or 
macroblock to macroblock within a given frame according 
to derived statistics of a characteristic (e.g., scene content) 
of the particular firame(s). 

One embodiment of an encoding system, generally 
denoted 300, in accordance with the principles of this 
invention is depicted in FIG. 5. The MPEG Standard is again 
assumed herein for purposes of explanation; however, those 
skilled in the art will understand that other implementations 
and standards can employ the adaptive encoding concepts of 
this invention. System 300 includes two encoder 
subsystems, designated El 330 and E2 360. In one 
implementation, encoder subsystems El and E2 are assumed 
to have identical hardware, but different software as 
described hereinbelow. El is programmed to generate the 
desired statistics, such as interframe/intraframe non-motion, 
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motion, etc. statistics, which are important to the encoding 
subsystem's (E2) specific bit rate control algorithm. E2 
generates .encoded frames based on the statistics generated 
by encoding subsystem El. 

Operationally, a sequence of video frames 310 is initially 
received into a frame store 320, where one or more frames 
are buffered depending upon the encoding specification 
(e.g., I, IP, IBP, IBBP encoding). This is accomplished by 
partitioning frame store 320 into an appropriate number of 
picture buffers (determined by group of picture (GOP) 
structure). These partitions are managed by a delay control 
logic 350. After sufficient delay, again determined by 
implementation, the video frame information is passed to 
encoder subsystem El 330, which derives the information 
on image statistics and stores this information in a statistics 
buffer 340 on a frame -by-frame basis. The delay control 
hardware 350 manages buffering of incoming video data and 
of image statistics, and feeds the video frames from frame 
store 320, as well as the derived statistics from statistics 
buffering 340, to encoding subsystem E2 360 in encode 
order. Using these statistics, subsystem E2 adaptively 
encodes the frames as described further below and outputs 
the encoded bitstream 370 in real time, delayed only by 
sufficient frame time to allow encoding subsystem El to 
generate the statistics on one or more characteristics of the 
received video input 310. 

A block diagram of a generalized encoding subsystem, 
denoted 400, is depicted in FIG. 6. System 400 comprises 
hardware/software for calculating non-motion statistics 420 
on the video data as it is received in the encoding subsystem, 
as well as an encoding engine 410 which consists of 
hardware/software to perform the actual video compression, 
i.e., motion estimation, motion compensation, quantization, 
variable length coding, etc. Encoding subsystem El 330 
(FIG. 5) will employ both statistics gathering logic 420 and 
encoding engine 410, while encoding subsystem E2 360 
(FIG. 5) will only employ encoding engine 410. Thus, 
during a first pass of encoding, i.e., via subsystem El, 
motion statistics based on motion vectors are calculated by 
encoding engine 410. Encoding subsystem E2 then outputs 
an encoded bitstream using a second pass through encoding 
engine 410. 

Real time operation, and associated frame delays of a 
system in accordance with this invention, is demonstrated in 
the MPEG -2 example of Table 1. In this example, one B 
picture is assumed between two anchor pictures (IBPBPBP. 
. .) and non- motion statistics are being collected. Only one 
frame of video data is buffered before the statistical 
calculation, and the delay between input and output of the 
frames is a maximum of four frame times in the example. 
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Other implementations will be apparent to those skilled in frames. DFD is the difference between a pixel value at 

the art employing the principles of the present invention. For the current frame and the corresponding (displaced by 

example, video data could be input into a frame store in an estimated motion vector) pixel value in the previous 

parallel with inputting it into the first encoding subsystem or future frame. Variance can be estimated as statistical 

El if the video data has no B pictures and the statistics are 5 average of DFDs, which are calculated for each pixel of 

intra frame. the entire frame. 

Examples of statistics calculated in encoding subsystem Local Quantities: 

El are next described. Variance of DFD for macroblock. 

As noted, encoding subsystem El calculates statistics Same as the global quantity, but the variance is esti- 

from the image data. Based upon these statistics, the sub- 10 mated for a macroblock. 

system can also carry out pre-processing steps, such as Variance of motion vectors (difference between neighbor- 
identifying scene change or fade detection. The particular ing motion vectors) 
statistics calculated by subsystem El depend on the given Scene Change Detection: 

implementation of a rate control algorithm within subsystem Consider two consecutive frames. The above statistics 

E2. In MPEG-2 encoding, there is a wide range of picture 15 (global and local) are available for each frame. There are 

statistics that can be used to determine the quantization for different possibilities to detect scene changes; for example: 

a frame or within a frame. The statistics discussed herein- a ) If ((AVACT(i)-AVACT(i-l))>thresholdl), then frame i 

below are provided by way of example only and other E2 belongs to the new scene. Thresholdl is determined 

video compression algorithms may employ different picture experimentally. 

quantities ' 20 b) If ((AID(i)-AID(i-l))>threshold2) then frame i 

Frame statistics can generally be divided into two groups, belongs to the new scene. Threshold2 is determined 

i.e., intraframe and interframe statistics. Intraframe statistics experimentally, 

are calculated using only pixel data within a frame, while c) Combination of conditions a) and b). 

interframe statistics are obtained using several consecutive d) [f ((AFD)>threshold3) , then frame { 5el t0 the new 

images irom an image sequence (generally two consecutive ^n. L u-. • 1 • i . „ 

• \ y c j * c + * ■ ** , . scene. ThresholdJ is determined experimentally, 

images). Intraframe and interframe statistics can be further x „ , , v . ' 

divided into global and local quantities. Global quantities e > Sce . nc chan f detection-interfrarne statistics. If (DFD 

describe characteristics of entire image frames, and local vanance>threshold4) and if ((AID(i)-AID (i-l)) 

statistical values are calculated for each subdivision of a * te s h old2), th f D frame ; belongs t0 ? e 5! w scene " 

frame, e.g., for each macroblock of a frame. The statistics 30 Thresholds are determined experimental^. Threshold4 

presented herein can be calculated from luminance data, has to be ch ° s ^ c r arefuU ^ b ~ a ^ ^ 0r \ e motlon 

however, additional statistics can be derived from chromi- vector * a ™ lable for a macroblock (MB) and it is not 

nance data as well necessarily the true motion vector for each pixel in the 

Intraframe Statistics: f _ MB * . J j L . .„ . 

Global Quantities' 35 a cnan S e ^ detected, this will be communicated 

a . t .' , ✓ * * * * - t0 system E2. Subsystem E2 may react by disregarding any 

Average Interpixel Difference (AID) of a frame — AID is • c * 4 l • «. /■ . . f r A , 

& r i_ i * j-ir L* t mformation from the previous picture which belongs to the 

the average of absolute differences between two con- m • ™ i * L j* j 

. , . , . , . r ^ ,™ . . . previous scenes, E2 can also rearrange the encoding modes 

sccuuve pixcis in ine image lines or a rxame. ine nigner * * «u « ♦ • . X * u i * 

• jl jr\ i t_ ■ l * +i . c „_ . & . or the pictures: e.g., the first picture that belongs to the new 

the AID, the higher the details in a frame. This quantity An . „ ? A , . . 

. , - , . <. . / 40 scene may be coded as an I picture, 

is calculated by hardware m the pixel interface as pixels p a( ^ e rj e tg C (i orr 

for the frame pass through. ^ ^ ac j e ^ 5 as j ca jjy a s j ow scene change, whereby the 
Average Activity (AVACT) of a frame— AVACT is esti- frames change g^aHy, m contrast to a true scene cut 
mated as the average of the macroblock variances in a where the change ^ abmpt A fade has tWQ directionSj 1} the 
frame. A macroblock variance can be estimated as 45 first dissolving into the fade, and 2) the fade crystal- 
statistical averages from the pixel values of a macrob- Uzing mt0 the second scenc> 

lock. The average activity gives information about the ^ pre sence of a fade and its direction may be deter- 

busyness of a picture. mined by a percentage of interpixel sums delta with respect 

joca. quantities: - l0 a p rev ious frame, i.e., if the value of the sum of the pixels 

During the calculation of the above global quantities, 50 of frame N+1 differs by an experimentally determined 

local measures can also be obtained. Thus, for each percentage from frame N, then a fade is occurring. Its 

subdivision, e.g. macroblock (MB), the MB -AID and direction is determined by the sign of the magnitude, i.e., 

MB-AVACT statistics can be stored and then used by being greater or , ess than mrQ lf a fade is detected) lhis is 

subsystem E2 for local adaptive encoding of a frame. communicated to E2 

Interframe Statistics: 55 ^ noted ab ove, encoder subsystem E2 can have the same 

These statistics describe the relationship between con- encoding engine architecture as encoder subsystem El, 

secutive frames of an image sequence, e.g. occurrence of however, the statistics gathering hardware/software would 

motion, scene change, fading, or identifying noise in a not be employed by subsystem E2. Adaptive encoding of a 

mac ^ obl ? ck - . . frame sequence is carried out by the rate control algorithm 

Global Quantities: 60 of the encodmg engine, i.e., using the above-generated 

Average Frame Difference (AFD) of a frame — AFD is the statistics. This is a 2-step process. 

average of absolute differences between the luminance At first, bit allocation is defined for each picture depend- 

pixel value of the current picture and the pixel at the mg 0 n the bit rate, encoding mode and the relative charac- 

same location in the previous frame. teristics of the frames to each other. Then, a corresponding 

Variance of the DFD (Displaced Frame Difference)— To 65 quantization parameter (QUANT) is defined. In an MPEG-2 

obtain DFD at a pixel, motion vectors have to be compatible bitstream, the QUANT value can change from 

calculated for each macroblock using two consecutive macroblock to macroblock allowing locally adaptive quan- 
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tization inside a frame. In accordance with this invention, a quality, highly compressed video stream 541 in real time, 

first global QUANT value is defined for each picture using Encoding subsystem E2 also receives a delayed 530 version 

the global quantities described above. The QUANT value of the input video data 501 which is synchronized to the 

for a particular macroblock is then obtained by modulating enhanced encoding parameter(s) received from controlling 

the global QUANT value based on the local statistics of the 5 processor 520 so that subsystem E2 encodes each frame 

macroblock. using its associated enhanced encoding parameters) from 

By detecting scene change within encoding subsystem the controlling processor. 

El, and with a prior knowledge of picture statistics, after The statistics produced by encoding subsystem El can be 

detection of a scene change information from previous based on information provided with the picture (intra-frame) 

pictures which belongs to the previous scene can be dis- 10 or information collected based on differences between mul- 

carded. For example, a new group of pictures (GOP) can be tiple pictures (inter- frame) as discussed above. Examples of 

started with the new scene. The global QUANT values can intra-frame statistics are: (1) picture complexity; (2) bit 

be calculated for the frames of the new GOP using pre- count; and (3) signal-to-noise ratio (SNR). Examples of 

defined initial rate control parameters instead of using inter-frame statistics include: (1) motion vectors; (2) motion 

parameters from the old scene, 35 detection; (3) repeat fields; (4) prediction error; and (5) 

If a fade is detected by subsystem El, then subsystem E2 scene changes. Techniques which can be employed for 

may react by using the proper reference frames for motion determining these statistics will be understood by those 

estimation/compensation, and/or change the encoding mode skilled in the art. 

of frames. An example of this may be forcing an I picture, The controlling processor employs the statistics from 

or coding all macrob locks in a P or B picture as intra- 20 encoding subsystem El to perform a statistical analysis 

macroblocks. routine to generate one or more parameters to control the 

Locally adaptive quantization is also possible. The aim in encoding process of the second encoding subsystem E2. 
this approach would be to distribute allocated bits among the Examples of controllable parameters include: bits per pic- 
macroblocks based on scene content. One possible approach ture; picture type; notification of repeat fields; field or frame 
would be to employ AVACT of the current picture to be 25 encoding; maximum amount of motion between frames 
encoded. The MB-AVACT for each macroblock are also (search window); and/or a quantization parameter (Mquant). 
stored in the statistics buffering. A QUANT value of an MB FIG. 8 depicts one embodiment of a processing routine 
is obtained by modulating the global QUANT value by the wherein the sequence of video data is analyzed within 
ratio of the AVACT and the MB-AVACT. For example, if the encoding subsystem El 510 to determine whether a scene 
macroblock activity is high with respect to the picture 30 change has occurred, and to provide a measure of picture 
AVACT, then the QUANT of this particular macroblock will quality. These statistics (along with possibly bits used, 
be increased with respect to the global QUANT, and vice picture type and target bit rate) are provided to the control- 
versa, ling processor (CP) 520 which in this example, develops 

Pursuant to the adaptive encoding system of this therefrom a quantization parameter (Mquant) that is pro- 
invention, the local statistics can also be employed to 35 vided to the second encoding subsystem E2 540. Encoding 
identify noisy macroblocks within a picture. If a macroblock subsystem E2 receives a delayed version of the sequence of 
has a high DFD value and its activity is also high, the video data used by encoding subsystem El and produces 
macroblock can be declared noisy. A noisy macroblock can therefrom the encoded video bitstream. 
then be encoded by a high QUANT value, which results in FIG. 9 depicts an example of a statistical technique or 
bit savings. These extra bits can be used for encoding 40 processing routine implemented within controlling proces- 
noiseless macroblocks within the frame. Consequently, the sor 520. The main function of the controlling processor in 
overall quality of the picture will be improved. this example is to determine a new set of encoder parameters 

One example of a manner in which the encoding system for a new picture based on statistics of the last encoded 

of this invention performs the function of locally adaptive picture and the history of previously encoded pictures within 

quantization may be the following. To determine the quan- 45 a given scene. The new set of encoder parameters allows the 

tization value for each macroblock, the encoding subsystem second encoding subsystem to produce a constant quality 

would receive the MB-AVACT and DFD from the statistics picture with variable output bit rate, 

store 340 (FIG. 5) for that macroblock. Subsystem E2 would The input video data to encoding subsystem E2 is iden- 

then use these values in determining the most efficient tical to that to encoding subsystem El, with the exception of 

QUANT value for the macroblock. 50 a delay 530. The delay allows encoding subsystem El to 

Further aspects of an encoding system/method in accor- process, for example, two pictures to initially generate an 

dance with the principles of the present invention are next average value for one or more characteristics of the pictures, 

described below with references to FIGS. 7-9. In this This delay also allows the controlling processor time to 

enhanced embodiment, the invention comprises real-time analyze the statistics from encoding subsystem El and 

encoding of video data using two encoders and a controlling 55 generate a set of new parameters for the current picture to be 

processor. As shown in FIG. 7, a first encoder, El 510 processed within encoding subsystem E2. 

receives input video data 501 and generates statistics there- The controlling processor preferably collects statistics on 

from for forwarding to a controlling processor (CP) 520. every picture processed within encoding subsystem El. This 

More particularly, encoding subsystem El receives the collection of statistics 600 includes in one embodiment 

uncompressed video data and derives statistics describing 60 scene change and picture quality statistics. Other statistics 

the current picture and its relation to other pictures in the which might be considered include bits used, picture type, 

sequence of video data, target bit rate, average picture Mquant, etc. In the routine of 

The controlling processor 520 then uses statistical tech- FIG. 9, the controlling processor first identifies if a given 

niques (described below) to analyze the generated statistics picture has content similar to a last picture, or if the current 

to develop one or more encoding parameters for use by 65 picture belongs to a new scene 610. If the current picture 

encoding subsystem E2 540. Encoding subsystem E2 uses does not belong to the same scene, then encoding parameters 

the enhanced encoding parameters) to produce a high are reset based on picture complexity of the current picture 
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620. These reset encoding parameters are provided to a 
buffer 660 to await output to and use by the second encoding 
subsystem 540 as described above. 

If the current picture's content is similar to that of the 
previous picture, i.e., there is no scene change, then the 
encoding parameters can be based on information from the 
previous pictures of the same type in the scene. For example, 
the routine of FIG. 9 employs "picture quality" to define one 
or more parameters. The picture quality indicates whether 
£xcessive bits were used by the first encoding subsystem El 
TcTencode the current picture. (The encoding parameters 
used by subsystem El could be those used by subsystem E2 
to encode the previous frame or they might comprise a 
reference set of parameters). 

A picture quality threshold is predefined and can be 
derived experimentally by one skilled in the art. For 
example, the threshold can be defined based on a desired bit 
rate and picture content. A method of measuring picture 
quality could be determining a signal-to-noise ratio. A 
typical reasonable signal-to-noise ratio is in the range of 30 
to 40 db. In one embodiment, "picture quality" may be 
determined by comparing an encoded and decoded version 
of the frame to the original version of the frame input to the 
first encoding subsystem El. The controlling processor 
initially determines whether the current picture quality (i.e., 
output quality from El) exceeds the predefined threshold 
630. If the threshold is exceeded, the amount of bits spenj pjp 
the current picture can be c onserved for more complex 
pictures in the sequencTTJfviSeo data. 

By way of example, the controlling processor collects and 
accumulates picture statistics such as average Mquant, aver- 
age quality, and average bits used by pictures of a given 
scene. These statistics on a newly El encoded picture of a 
current scene are compared with average scene statistics to 
determine how the coding parameters of the current picture 
should be refined. The controlling processor thus learns from 
the scene's history and adjusts the controlling parameters of 
the current picture for use by the second encoding subsystem 
E2. 

The amount of adjustment is related in this example to the 
most recent Mquant (herein "previous Mq") and the average 
Mquant (herein "Av Mq") of the same picture types. When 
the quality is higher than target quality threshold, the 
Mquant must be increased to lower bit consumption. The 
opposite applies when the quality is lower than target quality 
threshold. The amount of increase or decrease is the differ- 
ence between the updated average Mquant and the previous 
average Mquant. This increment provides a convergence 
towards the target quality threshold. 

Thus, when the picture quality exceeds the predefined 
threshold, the current Mquant to be employed by the second 
encoding subsystem (i.e., "next Mq") is defined as the most 
recent Mquant (previous Mq) plus the difference of a new 
average Mquant minus the previous average Mquant. The 
new average Mquant comprises an average Mquant aver- 
aged over the history of the pictures of the same picture 
types in the current scene up to and including the current 
picture to be encoded by the second encoding subsystem. 
The previous average Mquant is the average Mquant over 
the pictures in the scene of the same picture type prior to the 
current picture. The same difference quantity is employed as 
the offset if the output quality from the first encoding 
subsystem is less than the predefined threshold. Only in this 
case, the difference quantity is subtracted from the previous 
Mquant 650 in order to lower the Mquant to be used by the 
second encoding subsystem and thereby increase picture 
quality. As shown in FIG. 9, the next or current Mquant is 
buffered 660 for use by the second encoding subsystem E2. 
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When a current picture is the start of a new scene, the old 
history is not applicable. Therefore, a new set of control 
parameters must be ascertained based on the current picture 
complexity. This new set of control parameters is then 

5 applied to the first encoding subsystem El to generate 
statistics which are collected and refined based on output 
quality. Control parameters for the second picture of the 
scene within the encoding subsystem El are therefore 
adjusted upward or downward according to the output of the 

1Q first picture in the scene. The output statistics of pictures 1 
and 2 are weight averaged to form the starting history for the 
new scene, and the control parameters of picture 1 are 
derived as described above for the second encoding sub- 
system E2, 

The Mquant of each picture type following a scene 

15 change is based on the Mquant of the I picture following the 
scene change. For example, the initial Mquant for a P picture 
can be defined as 1.2 _Mquant (I) and the initial Mquant for 
a B picture can be 2 _M quant Mquant (I). Once the Mquant 
values per picture type have been initialized, the subsequent 

20 Mquant values are based on the average of the previous 
Mquant of the same picture type and the complexity of the 
current picture. A history of Mquants is collected per picture 
type and is refined until a scene change occurs, where the 
Mquants are re-initialized. 

25 As noted, a synchronously delayed version of the video 
encoding sequence (input data) is provided to the second 
encoding subsystem E2 where the actual encoding is per- 
formed. The delay can vary from as little as one frame to as 
many frames as the system can store in memory. For 

30 applications where statistical analysis across a large number 
of frames is desired, buffering of many video frames may 
become impractical. However, by using two video sources 
and delaying the start of the source feeding the second 
encoding subsystem E2, a greater frame delay can be 

35 realized without the addition of a large amount of memory. 
With these techniques, real-time data can be encoded by 
appropriate adjustment of delayed input data and the statis- 
tics. In a final step, the second encoding subsystem E2 
accepts the encoding parameters from the controlling pro- 

40 cessor and applies them to the current picture to produce a 
high quality, low bit rate MPEG-2 variable bit rate (VBR) 
compressed video stream. Although the above invention is 
described for a VBR application, those skilled in the art will 
realize that the techniques described can be extended to 

45 improve the quality of constant bit rate (CBR) encoding. 
This can be done by tuning the statistical analysis and 
encoding parameters for CBR applications. 

Those skilled in the art will note from the above discus- 
sion that encoding in accordance with the principles of the 

50 present invention results in improved picture quality com- 
pared with non-adaptive encoder systems, especially at low 
bit rates. This is because employing adaptive bit allocation 
among frames, as well as within frames, is more critical in 
low bit rate encoding compared with higher bit rate encod- 

55 ing. Further, the encoding technique of this invention can 
insure a semi-constant picture quality of a encoded/decoded 
video sequence in constant bit rate (CBR) mode or a 
constant picture quality in variable bit rate (VBR) encoding 
mode, 

60 In addition, the present invention can be included, for 
example, in an article of manufacture (e.g., one or more 
computer program products) having, for instance, computer 
usable media. The media has embodied therein, for instance, 
computer readable program code means for providing and 

65 facilitating the capabilities of the present invention. The 
articles manufactured can be included as part of the com- 
puter system or sold separately. 
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The flow diagrams depicted herein are provided by way of if said picture quality is greater than said threshold, said 

example. There may be variations to these diagrams or the quantization parameter is defined as the previous quantiza- 

steps or operations described herein without departing from tion parameter of the prior frame plus the difference between 

the spirit of the invention. For instance, in certain cases the the new average quantization parameter and the previous 

steps may be performed in differing order, or steps may be 5 average quantization parameter. 

added, deleted or modified. All these variations are consid- 7. The method of claim 5, further comprising buffering 

ered to comprise part of the present invention as recited in said value for said at least one controllable parameter and 

the appended claims. providing said encoding (c) with the sequence of video 

While the invention has been described in detail herein in frames in synchronous delay to said buffering to insure 

accordance with certain preferred embodiments thereof, 10 concurrent presence of a current frame and its corresponding 

many modifications and changes therein may be affected by value for said at least one controllable parameter at said 

those skilled in the art. Accordingly, it is intended by the second encoding subsystem. 

appended claims to cover all such modifications and changes 8. The method of claim 1, wherein said automatically 

as fall within the true spirit and scope of the invention. processing (b) comprises automatically determining said 

We claim: 15 value for said at least one controllable parameter for a 

1. A method for encoding a sequence of video frame current frame of the sequence of video frames using statis- 
comprising: tics on a prior encoded frame and a history of all prior 

(a) employing a first encoding subsystem to analyze the encoded frames of the same picture type in a current scene 
sequence of video frames and derive information on at containing the current frame to be encoded. 

least one characteristic thereof, said at least one char- 20 9. The method of claim 1, wherein said employing (a) 

acteristic comprising at least one of scene change, comprises deriving information on the sequence of video 

picture quality, bits used, target bit rate, and picture frames from at least one of luminance data and chrominance 

type; data of the sequence of video frames. 

(b) automatically processing said at least one character- 10* The method of claim 1, further comprising delaying 
istic to produce a value for at least one controllable 2 s lhe sequence of vide o frames input to said encoding (c) and 
parameter used in encoding the sequence of video controlling timing of said employing (a), automatically 
frames; and processing (b) and encoding (c) so that for each frame of the 

(c) encoding the sequence of video frames employing a sequence of video frames said employing (a) and automati- 
second encoding subsystem and using said value of call y processing (b) precede said encoding (c) to insure 
said at least one controllable parameter to produce a 30 presence of its corresponding value of said at least one 
bitstream of encoded video data. controllable parameter at said second encoding subsystem 

2. The method of claim 1, further comprising implement- employed in said encoding (c). 

ing said employing (a), automatically processing (b), and 11 11ie method of claim 1, wherein said at least one 

encoding (c) in real time. controllable parameter employed by said encoding (c) com- 

3. The method of claim 1, wherein said at least one 35 P rises . at least one of a quantization parameter, a bit 
characteristic comprises scene change and said automati- allocation, or an encoding mode. 

cally processing (b) comprises automatically evaluating 12 A svstem for encoding a sequence of video frames 

each frame for scene change, and upon detection of scene comprising: 

change disregarding information from previous frames of a fi ret encoding subsystem to analyze the sequence of 

the sequence of video frames in determining said value for 40 video frames and derive information on at least one 

said at least one controllable parameter used in encoding the characteristic thereof, said at least one characteristic 

sequence of video frames. comprising at least one of scene change, picture quality, 

4. The method of claim 3, wherein said at least one bits used, target bit rate, and picture type; 
characteristic further comprises picture quality, and wherein a control processor coupled to said first encoding sub- 
said automatically processing (b) comprises for each frame 45 system for automatically processing in real time said 
automatically comparing picture quality to a predefined information on said at least one characteristic to pro- 
threshold if scene change is undetected for said frame, duce a value for at least one controllable parameter 

5. The method of claim 4, wherein said automatically used in encoding the sequence of video frames; and 
processing (b) further comprises setting said value for said a second encoding subsystem coupled to said control 
at least one controllable parameter such that a number of bits 50 processor to receive said value of said at least one 
for encoding (c) each frame is increased if said picture controllable parameter, wherein said second encoding 
quality is less than said predefined threshold and is subsystem encodes the sequence of video frames using 
decreased if said picture quality is greater than said pre- said value of said at least one controllable parameter to 
defined threshold. produce a bitstream of encoded video data. 

6. The method of claim 5, wherein said at least one 55 13. The system of claim 12, wherein said bitstream of 
controllable parameter comprises a quantization parameter, encoded video date produced by said second encoding 
and wherein for each frame if said picture quality is less than subsystem comprises a variable bit rate bitstream or a 
said predefined threshold, said quantization parameter is constant bit rate bitstream. 

defined as a previous quantization parameter of a prior frame 14. The system of claim 12, wherein said first encoding 

of the sequence of video frames less a difference between a 60 subsystem, said control processor and said second encoding 

new average quantization parameter and a prior average subsystem each operate in real time, 

quantization parameter, wherein said new average quanti- 15. The system of claim 12, wherein said at least one 

zation parameter comprises an average quantization param- characteristic comprises scene change and said control pro- 

eter for all frames of a current scene including the current cessor comprises means for automatically monitoring each 

frame and said previous average quantization parameter 65 frame of the sequence of video frames for scene change, and 

comprises an average quantization parameter for all frames upon detection of scene change disregarding information 

of the current scene prior to the current frame, and wherein from one or more previous frames of the sequence of video 
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frames in determining said value for said at least one 
controllable parameter. 

16. The system of claim 15, wherein said at least one 
characteristic further comprises picture quality, and wherein 
said control processor comprises means for automatically 
comparing picture quality of each frame to a predefined 
threshold if scene change is undetected at said frame. 

17. The system of claim 16, wherein said control proces- 
sor further comprises means for automatically setting said 
value for said at least one controllable parameter such that 
a number of bits used by said second encoding subsystem in 
encoding said frame is increased if said picture quality is less 
than said predefined threshold and is decreased if said 
picture quality is greater than said predefined threshold. 

18. The system of claim 17, wherein said at least one 
controllable parameter comprises a quantization parameter, 
and wherein for each frame if said picture quality is less than 
said predefined threshold, said control processor comprises 
means for defining said quantization parameter as a previous 
quantization parameter of a prior frame of the sequence of 
video frames less a difference between a new average 
quantization parameter and a prior average quantization 
parameter, wherein said new average quantization parameter 
comprises an average quantization parameter for all frames 
of a current scene including the current frame, and said 
previous average quantization parameter comprises an aver- 
age quantization parameter for all frames of the current 
scene prior to the current frame, and wherein if said picture 
quality is greater than said threshold, said control processor 
comprises means for defining said quantization parameter as 
the previous quantization parameter of the prior frame plus 
the difference between the new average quantization param- 
eter and the prior average quantization parameter. 

19. The system of claim 17, further comprising means for 
buffering said value for said at least one controllable param- 
eter and means for providing said sequence of video frames 
to said second encoding subsystem in synchronous delay 
with said buffering to insure concurrent presence of a current 
frame and its corresponding value for said at least one 
controllable parameter at said second encoding subsystem. 

20. The system of claim 12, wherein said control proces- 
sor comprises means for automatically determining said 
value for said at least one controllable parameter for a 
current frame of the sequence of video frames using statis- 
tics on a prior encoded frame and a history of all prior 
encoded frames of a same picture type in a current scene 
containing the current frame to be encoded. 

21. The system of claim 12, wherein said at least one 
controllable parameter employed by said second encoding 
subsystem comprises at least one of a quantization 
parameter, a bit allocation, or an encoding mode. 

22. Computer program product comprising a compute 
usable medium having computer readable program code 
means therein for use in encoding a sequence of video 
frames, said computer readable program code means in said 
computer program product comprising: 

computer readable program code means for causing a 
computer to affect analyzing of the sequence of video 
frames to derive information on at least one character- 
istic thereof, said at least one characteristic comprising 
at least one of scene change, picture quality, bits used, 
target bit rate, and picture type; 

computer readable program code means for causing a 
computer to affect automatically processing of the at 
least one characteristic to produce a value for at least 
one controllable parameter used in encoding the 
sequence of video frames; and 
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computer readable program code means for causing a 
computer to affect encoding the sequence of video 
frames using said value of said at least one controllable 
parameter to produce a bitstream of encoded video 
5 data. 

23. The computer readable program code means of claim 

22, wherein said at least one characteristic comprises scene 
change and said computer readable program code means for 

!0 causing a computer to affect automatically processing com- 
prises computer readable program code means for causing a 
computer to affect automatically monitoring each frame for 
scene change, and upon detection of scene change disre- 
garding information from previous frames of the sequence 

35 of video frames in determining said value for said at least 
one controllable parameter. 

24. The computer readable program code means of claim 

23, wherein said at least one characteristic further comprises 
20 picture quality, and wherein said computer readable program 

code means for causing a computer to affect automatically 
processing comprises computer readable program code 
means for causing a computer to affect automatically com- 
paring picture quality of each frame to a predefined thresh- 
25 old if scene change is undetected for said frame. 

25. The computer readable program code means of claim 

24, wherein said computer readable program code means for 
causing a computer to affect automatically processing fur- 

30 ther comprises computer readable program code means for 
causing a computer to affect setting said value for said at 
least one controllable parameter such that a number of bits 
used in encoding said frame is increased if said picture 
quality is less than said predefined threshold and is 

35 decreased if said picture quality is greater than said pre- 
defined threshold. 

26. The computer readable program code means of claim 

25, wherein said at least one contro liable parameter coqj- 
4Q prises a quantization parameter, and wherein for each frame 

if said picture quality is less than said predefined threshold, 
said quantization parameter is defined by said computer 
readable program code means for causing a computer to 
affect automatically processing as a previous quantization 

45 parameter of a prior frame of the sequence of video frames 
less a difference between a new average quantization param- 
eter and a prior average quantization parameter, wherein 
said new average quantization parameter comprises an aver- 
age quantization parameter for all frames of a current scene 

50 including the current frame and said previous average quan- 
tization parameter comprises an average quantization 
parameter for all frames of the current scene prior to the 
current frame, and wherein if said picture quality is greater 
than said threshold, said quantization parameter is defined as 
the previous quantization parameter of the prior frame plus 
the difference between the new average quantization param- 
eter and the previous average quantization parameter. 

27. The computer readable program code means of claim 
60 25, further comprising computer readable program code 

means for buffering said value for said at least one control- 
lable parameter and providing said computer readable pro- 
gram code means for causing a computer to affect encoding 
with the sequence of video frames in synchronous delay to 
65 said buffering to insure concurrent availability of a current 
frame and its corresponding value for said at least one 
controllable parameter. 
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28. The computer readable program code means of claim 
22, wherein said computer readable program code means for 
causing a computer to affect automatically processing com- 
prises computer readable program code means for causing a 
computer to affect automatically determining said value for 
said at least one controllable parameter for a current frame 
of the sequence of video frames using statistics on a prior 
encoded frame and a history of all prior encoded frames of 
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a same picture type in a current scene containing the current 
frame to be encoded. 

29. The computer readable program code means of claim 
22, wherein said at least one controllable parameter com- 
prises at least one of bit allocation, a quant zation parameter, 
or an encoding mode. 
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